arXiv: 1507.08384vl [cs.DS] 30M2015 


The Submodular Secretary Problem Goes Linear 


Moran Feldman* Rico Zcnklusen ' 

July 31, 2015 


Abstract 

During the last decade, the matroid secretary problem (MSP) became one of the most prominent 
classes of online selection problems. The interest in MSP is twofold: on the one hand, there are many 
interesting applications of MSP; and on the other hand, there is strong hope that MSP admits 0(1)- 
competitive algorithms, which is the claim of the well-known matroid secretary conjecture. Partially 
linked to its numerous applications in mechanism design, substantial interest arose also in the study 
of nonlinear versions of MSP, with a focus on the submodular matroid secretary problem (SMSP). 
The fact that submodularity captures the property of diminishing returns, a very natural property for 
valuation functions, is a key reason for the interest in SMSP. So far, 0(l)-competitive algorithms have 
been obtained for SMSP over some basic matroid classes. This created some hope that, analogously 
to the matroid secretary conjecture, one may even obtain 0(1)-competitive algorithms for SMSP over 
any matroid. However, up to now, most questions related to SMSP remained open, including whether 
SMSP may be substantially more difficult than MSP; and more generally, to what extend MSP and 
SMSP are related. 

Our goal is to address these points by presenting general black-box reductions from SMSP to MSP. 
In particular, we show that any 0(l)-competitive algorithm for MSP, even restricted to a particular ma¬ 
troid class, can be transformed in a black-box way to an 0(l)-competitive algorithm for SMSP over 
the same matroid class. This implies that the matroid secretary conjecture is equivalent to the same con¬ 
jecture for SMSP. Hence, in this sense SMSP is not harder than MSP. Also, to find 0(l)-competitive 
algorithms for SMSP over a particular matroid class, it suffices to consider MSP over the same ma¬ 
troid class. Using our reductions we obtain many first and improved (){1 )-competitive algorithms for 
SMSP over various matroid classes by leveraging known algorithms for MSP. Moreover, our reductions 
imply an 0(log log(rank))-competitive algorithm for SMSP, thus, matching the currently best asymp¬ 
totic algorithm for MSP, and substantially improving on the previously best O (log (rank))-competitive 
algorithm for SMSP. 


Keywords: matroids, online algorithms, secretary problem, submodular functions 


‘School of Computer and Communication Sciences, EPFL. Email: moran.feldman@epfl.ch Supported by ERC Starting Grant 
335288-OptApprox. 

^ Department of Mathematics, ETH Zurich, and Department of Applied Mathematics and Statistics, Johns Hopkins University. 
Email: ricoz@math.ethz.ch 



1 Introduction 


Secretary problems are a very natural class of online selection problems with many interesting applications. 
The origin of the secretary problem is hard to track and dates back to at least the '60s fl2l UTS, 20.12 1 . 301. 
In its original form, also called the classical secretary problem, the task is to hire the best secretary out 
of a set E of candidates of known size n = \E\. Secretaries get interviewed (or appear ) one by one 
in a random order. All secretaries that appeared so far can be compared against each other according to 
an underlying linear ordering. Whenever a secretary got interviewed, one has to decide immediately and 
irrevocably whether to hire (or select) him. The task is to hire the best secretary with as high a probability as 
possible. Dynkin IIT21 provided an asympotically optimal algorithm for this problem, which hires the best 
secretary with probability at least 1/e. The classical secretary problem is naturally interpreted as a stopping 
time problem and, not surprisingly, was mostly studied by probabilists. 

During the last decade, interest in generalized versions of the classical secretary problem surged. One 
reason for this is a variety of applications in mechanism design (see Iflll3ll4ll26l and the references therein). 
These generalizations allow hiring of more than one secretary, subject to a given set of (down-closed) con¬ 
straints. Each secretary reveals a non-negative weight at appearance, and the task is to hire a maximum 
weight set of secretaries. The arguably most canonical generalization is the problem of hiring k out of n 
secretaries instead of a single one (see l26l ). However, more general constraints are required for many 
interesting applications. 

A considerably more general setting, known as the matroid secretary problem (MSP for short) and 
introduced in 01, allows for selecting a subset of E that is independent in a given matroid M. = (F. Z({j] 
Similar to the classical secretary problem, the number n = \E\ of candidates, or elements, is known upfront, 
elements appear in random order, and no assumption is made on their weights. Access to the matroid A4 is 
provided by an independence oracle that can be called on appeared elements, i.e., for any subset S C E of 
elements that appeared so far, one can check whether S € T or not. 

MSP attracted considerable interest recently. It is very appealing due to the fact that it captures a wide 
set of interesting selection problems in a single framework. Furthermore, matroids are highly structured con¬ 
straints, which gives reasonable hope that strong online algorithms exist. Indeed, there is a famous conjec¬ 
ture, which we simply call the matroid secretary conjecture, claiming the existence of an 0(1 (-competitive 
algorithm for the matroid secretary problem 0. We recall that an algorithm is a-competitive if the expected 
weight collected by the algorithm is at least ^ • m(OPT), where m(OPT) is the maximum weight of any 
feasible set, i.e., the offline optimum. Whereas this conjecture remains open, 0(1 (-competitive algorithms 
have been obtained for various special cases of matroids (see 0l9l fTTll24ll25ll27ll32ll36l ). The currently 
strongest asymptotic competitive ratio obtained for general MSP —without any restriction on the underlying 
matroid—is O (log log (rank)) ll28llT6l . where “rank” is the rank of M., i.e., the cardinality of a maximum 
cardinality independent set in M. 

Recently, increased interest arose in nonlinear versions of the secretary problem, with a focus on the 
maximization of a non-negative submodular functiorU E 0 M M El, leading to the submodular secretary 
problem. Submodular functions have widespread use as valuation functions because they reflect the property 
of diminishing returns, i.e., the marginal value of an element is the bigger the fewer elements have been 
selected so far. This makes them natural candidates for the matroid secretary setting. 

Additionally, submodular weight functions capture further generalizations of the secretary problem. For 

1 A matroid M = (E, X) consists of a finite set E and a non-empty family I C 2® of subsets of E, called independent sets 
that satisfy: (i) if I £ X and J C I then J £ X, and (ii) if I, J £ X with |/| > | J\ then there is an element e £ I \ J such that 
J U {e} £ X. For further basic matroidal concepts, such as rank and span, we refer to [|35l Volume B], 

2 A non-negative submodular function / on a ground set E is a function /: 2 E —> R + giving a non-negative weight to every 
subset of E and satisfying the following diminishing returns property: f(A U{e}) — f{A) > /(BU (e}( — f(B) for A C B C E 
and e £ E\B. 
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example, Gilbert and Mosteller f22ll and Freeman fl9Tl considered a variation of the classical secretary 
problem where one can select k elements with the goal of maximizing the value of the highest-valued 
element. This problem can be phrased as a submodular secretary problem with the submodular function 
/ : 2 e —> R + defined by f(S) = max{w(e) | e G S} for S C E, where w{e) is the weight revealed by 
element e. 

The currently best asymptotic competitive ratio for the submodular matroid secretary problem (SMSP) 
is O (log (rank)) l23l . Furthermore, ()( 1 )-competitivc algorithms have been obtained for special classes of 
matroids, including uniform matroids |6l fl5l 1231. partition matroids llT5l l23l . and laminar and transversal 
matroids i32l (both only for monotone^ submodular functions). 

In general, our understanding of secretary problems is much more limited when dealing with submodular 
weights instead of linear ones, leading to many open questions. In particular, is there hope to get an 0(1)- 
competitive algorithm for SMSP? Notice that this corresponds to the matroid secretary conjecture translated 
to the submodular case. Or may the submodular case be substantially harder than the linear one? Do 
monotone SMSP admit considerably better competitive ratios than nonmonotone ones? Can we leverage 
strong algorithms for MSP to obtain results for SMSP? 

The goal if this paper is to address these questions and get a deeper understanding of SMSP and its 
relation to MSP, independently of the structure of particular classes of underlying matroids. 

1.1 Our Results 

Our main result below shows an intimate relation between the (linear) matroid secretary problem and the 
submodular version. More precisely, we show that one can use any algorithm for MSP as a black box to 
obtain an algorithm for SMSP with a slightly weaker competitive ratio. 

Theorem 1.1. Given an arbitrary algorithm Linear for MSP and a value a. > 1, there exists an algorithm 
for SMSP whose competitive ratio is at most 24a(3a + 1) = O(a 2 ) for every matroid M. on which Linear 
is guaranteed to be at least a-competitive. 

Theorem ll.ll has several interesting implications. In particular, if there is an 0(l)-competitive algorithm 
for the linear case, then there is an 0(l)-competitive algorithm for the submodular case. Hence, the matroid 
secretary conjecture is equivalent to the same statement for the submodular version. This provides strong 
hope that constant-competitive algorithms exist for SMSP. 

Furthermore, Theorem 11.11 implies many new results for SMSP, both for the general version with¬ 
out any restriction on the matroid as well as for many special classes of matroids, by leveraging algo¬ 
rithms for MSP. In particular, the known O (log log (rank) )-competitive algorithms for MSP 128 . 161 imply 
0((log log (rank) ) 2 )-competitive algorithms for SMSP, which already considerably improves on the previ¬ 
ously best O (log(rank))-competitive algorithm for SMSP ll23l . We later strengthen this result to match the 
asymptotically best algorithm for MSP. 

The only matroid classes for which 0(l)-competitive algorithms for SMSP have been explicitly given, 
without assuming monotonicity of the submodular weight function, are uniform matroids J6lj231> unitary 
partition matroids l23l and matroids for which a reduction to unitary partition matroids is known. Such 
reductions are known for graphic matroids [[2] [23]], cographic matroids ll36l . and for max-flow min-cut ma¬ 
troids tm . These reductions have originally been used to obtain 0(l)-competitive algorithms for MSP over 
these matroids, but they lead also to algorithms for SMSP over the same matroid classes when combined 
with an algorithm for SMSP over unitary partition matroids. For other classes of matroids, Theorem ll.il 
implies the first 0(1 (-competitive algorithm for SMSP, by leveraging known 0(l)-competitive algorithms 
for MSP, such as the ones known for transversal matroids lf27l and laminar matroids |[24l . Furthermore, we 

3 A submodular function is monotone if S C T C E implies f(S) < f(T). 
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also improve the competitive ratios for most matroid classes for which 0(l)-competitive algorithms have 
already been known. 

The analysis of the algorithm that we use to prove Theorem ll.ll can easily be improved in many cases, if 
the algorithm Linear obeys some natural properties. Theorem [L2]below gives a first strengthening of Theo- 
rem ll.ll and allows us to derive an 0(log log (rank) )-algorithm for SMSP, thus, matching the currently best 
algorithm for MSP up to a constant factor, and improving on the previously best 0(log(rank)) competitive 
algorithm for SMSP. 

We highlight that Theorem ll.ll can be obtained from Theorem 1 1.2l bv setting k = 1. 

Theorem 1.2. Given an algorithm Linear for MSP and value a > I, there exists an algorithm for SMSP 
whose competitive ratio is at most 24k a (3 a + 1) = 0(k ■ a 2 ) for every matroid Ad on which the behavior 
of Linear can be characterized as follows. 

• For every MSP instance over the matroid AT there exists a set ofk (correlated) random sets {P, }^ =1 
such that each set Pi is always independent in Ad and E[u?(U^ =1 Pi)] > m(OPT w )/a, where w is 
the weight function of the MSP instance and OPT,,, is the maximum weight independent set given w. 

• Linear outputs a uniformly random set from { P, }(' = |. 

Observe that such an algorithm Linear is (k ■ a)-competitive for AT 

Corollary 1.3. There exists an O (log log (rank) )-competitive algorithm for SMSP. 

Proof. Feldman et al. lfl6l describe an 0(log log (rank))-competitive algorithm for MSP. Their analysis of 
the algorithm shows that in fact it obeys the requirements of Theorem 1 1.2 1 for Linear with a = 0(1) and 
k = O (log log (rank)). Notice that the algorithm guaranteed by Theorem 1 1.21 does not depend on k. Hence, 
there is no need to know the rank ahead of time to use this algorithm. □ 

In what follows we present further useful strengthenings of Theorem 11.11 that are readily obtained 
through our derivation of Theorem 11.11 These results allow for obtaining stronger competitive ratios if 
Linear fulfills some very typical properties, or if the submodular valuation function is monotone. When 
dealing with monotone submodular functions, we talk about the monotone submodular secretary problem 
(MSMSP). We first state the improved reductions and later present their implications in terms of competitive 
ratios in the form of a table. 

Theorem 1.4. Given an algorithm Linear for MSP and values a > 1 and q € (0,1], there exists an 
algorithm for SMSP whose competitive ratio is at least 24a(3qa + 1) = ()(q ■ a 2 ) for every matroid Ad 
on which Linear is guaranteed to: 

• be at least a-competitive. 

• select every element with probability at most q. 

The results proved above can be somewhat improved for monotone objective functions. Appendix [A] 
explains how the proofs of Theorems 11.1111.21 and 11.41 need to be changed in order to get the following 
theorem. 

Theorem 1.5. When SMSP is replaced with MSMSP in Theorems I/. /I I/.21 and \I,4\ their guarantees can 
be improved to 8a(a + 1), 8 ka(a + 1), and 8 a(aq + 1), respectively. 

For monotone functions there is one additional natural property of Linear that can be used to get a 
stronger result. Intuitively this property is that Linear selects every element of the optimal solution with 
probability at least a -1 , and thus, is a-competitive. Many algorithms have this property when items have 
disjoint weights, and thus, there is a single optimal solution. Such algorithms are often extended to general 
inputs by introducing a random tie breaking rule. The following theorem is designed to deal with algorithms 
obtained this way; its proof can be found in Appendix lAl 
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Theorem 1.6. Given an arbitrary algorithm Linear for MSP and a value a > 1, there exists an algorithm 
for MSMSP whose competitive ratio is at most 16a = 0(a) for every matroid A4 on which Linear has 
the following property: For every MSP instance over the matroid Ai = (J\T, X), there exists a random set 
S C E obeying: 

• S is always an optimal solution of the MSP instance. 

• For every element u £ E, Pr[u is selected by Linear] > Pr[u £ S]/a. 

The following table summarizes the competitive ratios we obtain, for SMSP and MSMSP over partic¬ 
ular matroid classes, by leveraging the above-presented reductions^] A straightforward application of these 
reductions leads to improvements or even first results for all matroid classes listed in the table. However, 
using some additional observations we can sometimes get further improvements. Further details on how the 
stated results are implied by our reductions can be found in Appendix |B] Our improvement for unitary par¬ 
tition matroids also implies improvements by the same factor for all matroid classes for which a reduction 
to unitary partition matroids is known. 


Matroid Type 

Known Competitive Ratio 

Our Competitive Ratio 

Unitary partition matroids 

1297 W E3 

261 

Transversal matroids 

— 

2496 

/c-sparse linear matroids - 2 - 1 

— 

24/ce(3/c + 1) 

/c-sparse linear matroids (MSMSP) 

- 

8 ke(k + 1) 

Laminar matroids 

— 

585 

Laminar matroids (MSMSP) 

211 (321 

144 


Gupta et al. 1231 do not explicitly calculate the competitive ratio of their algorithm, however, its 
competitive ratio is no better than 48000/37 ~ 1297. 

A /c-sparse linear matroid is a linear matroid with a matrix representation using at most k non¬ 
zeros per column. 

We highlight that apart from the unitary partition matroid case, all other results in the above table for 
particular matroid classes assume prior knowledge of the matroid, in addition to the size of its ground set. 
This is due to the fact that the corresponding MSP algorithms that we put into our framework to get results 
for SMSP make this assumption. 

Remark: The proofs of all the above theorems use Linear in a black-box manner. Hence, these theorems 
apply also to many models allowing the algorithm more information, such as the model in which the al¬ 
gorithm has full knowledge about the matroid from the beginning (but not about the objective function). 
Also, we emphasize that all the above results are general reductions from MSP to SMSP, not assuming any 
particular structure about the underlying matroid. This is in stark contrast to almost all results on SMSP so 
far. 

1.2 Further Related Work 

Progress has been made on the matroid secretary conjecture for variants of MSP which modify the assump¬ 
tions on the order in which elements arrive and the way weights are assigned to elements. One simpler 
variant of MSP is obtained by assuming random weight assignment. Here, an adversary can only choose 
n = | E\ not necessarily distinct weights, and the weights are assigned to the elements uniformly at ran¬ 
dom. In this model, a 5.7187-competitive algorithm can be obtained for any matroid 1361 . Additionally, 
a 16(1 — 1/e)-competitive procedure can still be obtained in the random weight assignment model even 
if elements are assumed to arrive in an adversarial order l34l l36ll . Hence, this variant is, in a sense, the 

4 For readability, the competitive ratios in the table have been rounded to the closest integer. 
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opposite of the classical MSP, where weights are adversarial and the arrival order is random. Furthermore, 
a 4-competitive algorithm can be obtained in the so-called free order model. Here, the weight assignment 
is adversarial; however, the algorithm can choose the order in which elements arrive CD ED]. Among the 
above-discussed variants, this is the only variant with adversarial weight assignments for which an 0(1)- 
competitive algorithm is known. For more information on recent advances on MSP and its valiants we refer 
to the survey iflOl . 

We also highlight that SMSP is an online version of submodular function maximization (SFM) over a 
matroid constraint. Interestingly, even in the offline setting, 0(l)-approximations for SFM over a matroid 
constraint have only been discovered very recently, starting with a (4 + e)-approximation presented in |29l . 
Considerable progress has been made in the meantime ll34l fl5ll . The currently strongest approximation 
algorithm has an approximation ratio of about e ~ 2.718 lfl5l . We refer the interested reader to |j8j [15] for 
more information on constrained SFM. 

1.3 Organization of the paper 

We start by formally introducing our problem and some basic notation and results in Section [2j Section [3] 
presents our main algorithm that we use to prove our results. Finally, Section 0] provides details on the 
analysis of our algorithm. 

2 Preliminaries 

In this section we formally define our problem and state some notation and known results that are used later 
in the paper. 

2.1 Problems and Standard Notation 

An instance of the Submodular Matroid Secretary Problem (SMSP) consists of a ground set E, a non¬ 
negative submodular objective /: E —» M + and a matroid constraint A4 = (E.Z). An algorithm for 
this problem faces the elements of E in a uniformly random order, and must accept or reject each element 
immediately upon arrival. The algorithm has access to n = \E\ and two oracles: 

• A value oracle that, given a subset S C E of elements that already arrived , returns /(.S'). 

• An independence oracle that, given a subset S C E of elements that already arrived , determines 
whether S £ X. 

The objective of the algorithm is to accept an independent set of elements maximizing /. 

The Matroid Secretary Problem (MSP) can be viewed as a restriction of SMSP to linear objective 
functions. More formally, an instance of MSP is an instance of SMSP in which the function f(S) is 
defined by f(S) = w ( u ) for some set of non-negative weights {w(u) \ u £ Ej. Similarly, the 

Monotone Submodular Matroid Secretary Problem (MSMSP) is a restriction of SMSP to non-negative 
monotone submodular objective functions. 

The following notation comes handy in our proofs. Given a set S and an element u, we use S + u 
and S — u to denote the sets S U { u } and S \ {«}, respectively. Additionally, given a weight function 
w : E — y M + , we use w(S) as a shorthand for w(u). Finally, given a set function f: 2 E — > M, we 

denote by /(it | S ) the marginal contribution of adding u to S. More formally, /(it | S) := f(S+u) — f(S). 
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2.2 The Function f w 

Given a set function f: 2 E —>• R and a weight vector w £ W E , let f w : 2 E —>• R be the function defined as: 

f w (S) = min{/(A) + w(S \ A)} VS C E . 

This construction of f w out of / and w is well known in the field of submodular function optimization, and 
is sometimes called convolution (see, for example, lOTI ). We state some basic properties of f w . Proofs of 
these results can, for example, be found in lf33l Chapter 10]. 

Property 2.1. For every set S C K, f(S) > f w (S), and for S = 0 the inequality holds with equality. 

Property 2.2. If f is a non-negative submodular function and w is non-negative, then f w is also non¬ 
negative and submodular. Moreover, if f is also monotone, then so is f w . 

2.3 Known Lemmata 

We need the following known lemmata. The lemmata have been rephrased a bit to make the difference 
between them clearer. 

Lemma 2.3 (Lemma 2.2 of Il3l ). Let g: 2 s —R + be a submodular function. Denote by A(p ) a random 
subset of A where each element appears with probability p (not necessarily independently). Then, 

Eb(A(p))] > (1 -p)g(0) +p-g{A) . 

Lemma 2.4 (Lemma 2.2 of Q). Letg : 2 E ->R+be a non-negative submodular function. Denote by A{p) 
a random subset of A where each element appears with probability at most p (not necessarily independently). 
Then, 

E b(A(p))] > (1 -p)g(0) ■ 


3 Algorithm 

The algorithm used to prove our results for SMSP is given as Algorithm [Q Observe that the algorithm has 
a single probability parameter p £ (0,1). Additionally, the algorithm uses an arbitrary procedure Linear for 
MSP whose existence is assumed by Theorem 11.11 and our other results for SMSP. Finally, the algorithm 
also uses as a subroutine the standard greedy algorithm for maximizing a submodular function subject to 
matroid constraints (denoted by Greedy), which can be found, for completeness, in Appendix [Cpl While 
reading the description of Algorithm [U the only important thing one has to know about Greedy is that it 
creates its solution by starting with the empty set and adding elements to it one by one. 

A key challenge in trying to leverage Greedy in an algorithm for SMSP, is that Greedy is not a 
constant-factor approximation algorithm for submodular function maximization over a matroid (or even in 
the unconstrained setting). In our analysis we show that Algorithm Q] manages to circumvent this issue, 
and the set M, produced within Algorithm [2 using Greedy, does provide a constant approximation for the 
optimal solution. 

Observe that Algorithm |T] can be implemented online because whenever an element u is fed to Linear, 
the algorithm already knows whether u £ N, and thus, can determine the membership of u in Q n N 
immediately after Linear determines the membership of u in Q. Additionally, note that Algorithm |T] applies 

5 For non-monotone functions there are two versions of Greedy: one that stops picking elements once they all have negative 
marginal contributions, and one that continues as long as possible. In this paper we use the first version. 
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Algorithm 1: Online (p) 

// Learning Phase 

1 Choose X from the binomial distribution B(n, 1/2). 

2 Observe (and reject) the first X elements of the input. Let L be the set of these elements. 


// Selection Phase 

3 Let M be the output of Greedy on the set L. 

4 Let N 4— 0. 

5 for each arriving element u G E \ L do 

6 Let w(u) 0. 

7 if u is accepted by Greedy when applied to M + u then 

8 with probability p do 

9 Add u to N. 

to Let M u C M be the solution of Greedy immediately before it adds u to it. 

it Update w(u) 4— f(u \ M u ). 


12 


Pass u to Linear with weight w(u). 


13 return Q d N, where Q is the output of Linear. 


Linear to the restriction of its input matroid At to the set E\L. We assume in the analysis of AlgorithmQ] 
that Linear is a-competitive for that restriction of At whenever it is a-competitive for At. Many algorithms 
for MSP obey this property without any modifications, but for some we need the following proposition 
which proves that this assumption can be justified in general. 

Definition 3.1. Partial-MSP is a variant of MSP where an instance consists also of a set L C E known to 
the algorithm. The elements of E\L arrive at a uniformly random order as usual. The elements of L never 
arrive, and thus, cannot be added to the solution. However, the oracles can answer queries about them as if 
they arrived before the first element of E\L. 

Proposition 3.2. Given an algorithm Li near for MSP, there exists an algorithm for Partial-MSP whose 
competitive ratio for every matroid At is as good as the competitive ratio of Linear for this matroid. 

The proof of Proposition 13. 2l can be found in Appendix iDl Notice that Linear indeed faces an instance 
of Partial-MSP since the set L is fully known before Linear is invoked for the first time, and the weights 
assigned to elements depend solely on L. The following simple observation is well known. A proof of it 
can be found, e.g., as Lemma A.l. of lfl6l . 

Observation 3.3. The set L constructed by Algorithm \T]contains every element of E with probability 1/2, 
independently. 

As is, it is difficult to prove some claims about Algorithm Q] For that purpose, we present Algorithm [2] 
which is an offline algorithm sharing the same joint distribution (as Algorithm Q} of the set M and the output 
set (a justification of this claim can be found in Appendix O. Clearly, the competitive ratio of Algorithm |T| 
is equal to the approximation ratio of Algorithm [2] 

Notice that Algorithm [2] also makes Linear face an instance of Partial-MSP (with L = E \ (N U No)). 
Hence, by Proposition 13.21 we can assume the competitive ratio of Linear for At extends to the instance it 
faces in Algorithm [2] 

6 The restriction of a matroid M = (E,I) to a subset E' C E of its ground set is the matroid M' = (E',T fl 2 s ). 
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Algorithm 2: Simulated(p) 

// Initialization 

1 Let M, N, N 0 <- 0. 

2 Let E' <— E. 


3 

4 

5 

6 

7 

8 
9 


// Main Loop 

while E' / 0 do 

Let u be the element of FA maximizing f(u \ M ), and remove u from E'. 

Let w(u) f(u | M). 

if M + u is independent in Ad and w(u ) > 0 then 

with probability 1/2 do Add u to M. otherwise with probability p do Add u to N. 
otherwise 

Update w(u) «— 0. 

Add u to A/). 


to 

it 

12 


else 


Let w(u) -t— 0. 

Add u to Nq with probability 1/2. 


13 Run Linear with N U A r o as the input (in a uniformly random order) and the weights defined by w. 

14 return Q d N, where Q is the output of Linear. 


Algorithms [I] and |2]can be viewed as generalizations of algorithms presented by |[32ll for MSMSP over 
laminar matroids. In particular, the idea of defining a surrogate weight function w based on a subset of 
the elements sampled at the beginning of the algorithm was already used in l i32l . Although our analysis of 
Algorithms |T|and [2] is quite different and much more general than the analysis of lf32l . it does borrow some 
ideas from 11321 . The concept of an offline algorithm simulating a more difficult to analyze online algorithm 
has been previously used, even specifically for offline simulations of an online greedy algorithm (9] [27 , 
[32]. A key novel contribution of our analysis, compared to 1321 . is that we manage to relate the expected 
m-weight of a maximum w>weight independent set in N to E[u;(M)] (see Lemma 14.81 ). Furthermore, 
we overcome several technical hurdles by first comparing values of constructed sets with respect to the 
convoluted submodular function f w instead of the original function /. Finally, our use (and analysis) of a 
modified greedy algorithm allows us to deal with non-monotone submodular functions. 

4 Analysis of Algorithms H and |2] 

Throughout this section (except in Section l4~4b we fix an arbitrary matroid AA = (F.'I) for which Linear 
is o-compctitive, and analyze the approximation ratio of Algorithm [2] for this matroid. Since Algorithms [j] 
and [2] share their output distribution, the approximation ratio we prove for Algorithm [2]implies an identical 
competitive ratio for Algorithm [I] The analysis of the approximation ratio consists of three main stages. In 
each stage we study one of the sets M, N and N n Q. Specifically, we show bounds on the expected values 
assigned to each one of these sets by w and /,„. Notice that once we have a bound on E [f w (N fi Q)], we also 
get a bound on the expected value of the solution produced by Algorithm [2] since f(N n Q) > f w (N n Q) 
by Property 12.1 1^1 

Following is some notation that is useful in many of the proofs below. For every element u £ E, let E u 

7 Observe that the weights w chosen by Algorithm[2]are always non-negative, thus, we can use all the properties of f w stated in 
Section[2] 











be the set of elements processed by Algorithm [2] before u. Then, we define: 


N u = N n E u and 
Similarly, we also define E' u = E u + u and: 

N' u = N n E' u and 


M u = M n E u . 

M'u = M n E' u . 


4.1 First Stage 

We now begin with the first stage of the analysis, i.e., bounding E [w(M)] and E \f w {M)\. The following 
lemma shows that both values arc in fact strongly related to E[/(M)]. 

Lemma 4.1. w(M) + /(0) = f(M) = f w (M). 

Proof. Observe that, by construction, w(u) = f(u \ M u ) for every u E M. Hence, the equality w(M ) + 
/(0) = f(M) holds since: 


w{M) =J2f( u I M “) = /W - /( 0 ) ' 

ugm 

Let us now prove the equality f w (M) = f(M). By Property 12. 11 f w (M) < f(M). Thus, we only need 
to prove the reverse inequality. For every set ACM: 

f(A) + w(M \ A) = /(0) + f(u \ An M u ) + ^2 f( u I M n) 

uGA uGM\A 

> f(<z) + ^2f( u i M u ) + i M “) = > 

uGA uGM\A 

where the inequality follows from submodularity. Hence, by the definition of f w (M ): 

fw(M) = min {/(A) + w(M \ A)} > min /(M) = f(M ) . □ 

The bounds given by the above lemma are in terms of f(M). To make these bounds useful, we need 
to bound also E[/(M)]. This is not trivial since Greedy is not a constant-factor approximation algorithm 
for submodular function maximization over a matroid. Different approaches are known to adapt or extend 
Greedy such that it provides an 0(l)-approximation in the offline setting. However, these are not well- 
suited for the way we simulate greedy online. The next two lemmata show a very simple way to transform 
Greedy into an 0(^-approximation algorithm for submodular function maximization; and most impor¬ 
tantly, this adjustment of Greedy is trivial to simulate online. We are not aware of any previously known 
variation of Greedy that provides an 0(^-approximation for submodular function maximization over a 
matroid constraint, and that can easily be simulated in the online setting. 

The next lemma is similar to Lemma 3 of l23ll . but does not assume that / is normalized (i.e., /(0) = 0). 
The proof of the lemma is deferred to Appendix [D] since it goes along the same lines as the proof of |[23l . 

Lemma 4.2. For a matroid and a non-negative submodular function f, if S is the independent set returned 
by Greedy, then for any independent set C, f(S) > f(C U S)/2. 

The following lemma gives the promised variant of Greedy. 

Lemma 4.3. Let S be a random set containing every element of E with probability 1/2, independently, then 
E[f(Greedy(S))}> f (OPT)/8. 
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Proof. Let T be the output of Greedy. By Lemma l4~2l the following inequality always holds: 


f(T)> 


/(Tu(OPTn5)) 

2 


Hence, 


E [f(T)]= Pr[OPT nS = A] • E[/(T) | OPT r\S = A] 

ACOPT 

>y Pr[OPTn5 = A] -E[/(TUA) | OPTnS = A] . 

ACOPT 


Let ^a(P) = f(T U A). It is easy to check that (),\ is a non-negative submodular function for every 
choice of set A. Additionally, T C S, and thus, contains every element of E \ A with probability at most 
1/2 even conditioned on OPT fl S = A. Hence, by Lemma 1231 

E[/(T U A) | OPT OS' = A] =E \g A (T\A) | OPT OS' = A] > . 

Combining the two last inequalities gives: 

E[/(T)]>I. E (Pr[OPTn S ^].^)^M2^1>m, 

ACOPT ^ ' 

where the last inequality follows by Lemma [231 □ 

Corollary 4.4. E[/(M)] > /(OPT)/8. 

Proof. We prove the corollary for the set M of Algorithm [TJ which is fine since it has the same distribution 
as the set M of Algorithm [2J Observe that the set L of Algorithm Q] contains every element with probability 
1/2, independently, and M is the result of applying Greedy to this set. Thus, the corollary holds by 
Lemma 14731 □ 


4.2 Second Stage 

In the second stage we use the bounds proved in the first stage to get bounds also on E[m(lV)] and E[/,, ; (A r )]. 
Lemma 4.5. E[m(Al)] = p ■ E [w(M)\. 

Proof. Consider an arbitrary element u £ E processed by Algorithm [2j and let us fix all history up to the 
point before u is processed. If M + u f T or f(u \ M) < 0, then there is a zero expected increase in both 
w(M ) and w(N) during the processing of u. Otherwise, the expected increase in w(M) is w(u)/ 2, while 
the expected increase in w(N) is (p/2) ■ w(u ). Hence, if we denote by Am the expected increase in w(M ) 
and by An the expected increase in w(N), then: 

a (p/2) • w(u) A A 

= p ■ a m . 

w(u)/ 2 

The lemma now follows from the linearity of expectation. □ 

Getting a bound on f w ( N ) is somewhat more involved. 

Lemma 4.6. K[f w (N)] > ^ + p -^- • E[/ W (M)]. 
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Proof. Consider the auxiliary function N) = f w (N ) — j/-j j • f w (M U N). Observe that $ is in fact 
a random function since it depends on the random vector w. Our first objective in this proof is to show that 

E[<b(M,iV)] > (l+p)- 1 -f(0) . (1) 

For that purpose, we define A u as the change in + (+/, N) when u is processed by Algorithm [2] More 
formally, A u = <&(M' U1 N' U ) — &(M U , N u ). Additionally, let 11,, be an event encoding all the random 
decisions of Algorithm [2] up to the moment before it processes u, and let 1Z U be the set of all such events. 
Then, since, by Property 12.11 <b(0, 0) = (1 + p)~ l ■ f w (0 ) = (1 + p ) _1 • /(0): 

E[^(AT,A^)]-(l+p)- 1 -/(0) = ^E[A u ] = ^ Y, ( Pr [^«] ' E[A U | R u ]) . 

ueE u&E Ru&Uu 

Thus, to prove Inequality dTJ it is enough to show E[A U | R u \ > 0 for an arbitrary element u € E and event 
R u € TZ U . Notice that conditioned on R u , the sets M u and N u and the part of the vector w corresponding 
to E' u are all deterministic. If M u + u f Z or f(u \ M u ) < 0, then we are done since M u = M' u and 
N u = N' u . Thus, we only need to consider the case M u + u £ X and f(u \ M u ) > 0. In this case u is added 
to M with probability 1/2 and to N with probability p/2. Thus, 

E [f w (<) - U(N u ) | R u ] = Pr[u € N \ R u \ ■ E [f w (u \ N u ) \ R u \ 

> Pr[rt G N | Ru] ■ E [f w (u \ M u U N u ) \ R u \ 

= 1 ■ Pr [ w 6 M U N I Ru] • E [f w (u I M u U N u ) I R u \ 

1/2+ p/2 

= ' HfwiK U K) - f w (M u U N u ) | R u ] , 

where the inequality holds since f w is submodular by Property 12.21 Rearranging the last inequality yields 
E[A U | R u \ > 0, which completes the proof of Inequality dTJ. 

Next, observe that for an element to enter N, three things have to happen: first it must hold that M u +u G 
Z and /(u \ M u ) > 0, then the algorithm must randomly decide not to add the element to M and finally the 
algorithm must randomly decide (with probability p) to add the element to N. The last decision does not 
affect the future development of M and w, and thus, even conditioned on M and w, every element belongs 
to N with probability at most p. To use the last observation, let = f w (M U S ). One can observe 

that g W) M is non-negative and submodular. Thus, by Lemma lA4l 

E [f w (M U N) | M, w] = E[g WiM {N) \ M, w] > (1 - p) ■ g w ,M( 0 ) = (1 - p) ■ fw(M) . 

By the law of total expectation, the above inequality implies: 

E[f w (M UN)] > (l-p)-E[f w (M)] , 

which implies the lemma when combined with Inequality £[]). □ 

4.3 Third Stage 

In the third stage we use the bounds proved in the first and second stages to get bounds on E[m(Q 0 N )] 
and E [f w (Q H N )]. For that purpose, let us define ()PT„;(A r ) as the maximum weight independent set in 
N with respect to the weight function w. The fact that Linear is o-competitive for Ad implies the following 
observation. 

Observation 4.7. E [w(Q FI N )] = E[m(Q)] > p • E[m(OPT^(A r U No))] = p • E[m(OPT„,(A))]. 
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Thus, to get a lower bound on ~K[w(Q n N )] it suffices to get a lower bound on E[ru(OPT^(A r ))]. 
Lemma 4.8. E[w(OPT^(iV))] > . E[w(M)]. 

Proof. For this proof we need Algorithm [2] to maintain two additional sets N' and H. The set N' is the set 
of elements of N that are not spanned by previous IV-elements when added to N. More formally, the set 
N' is originally empty. Whenever an element u is added to N, it is also added to N' if it is not spanned by 
previous elements of N. Clearly, the set N' C N at the end of the procedure is an independent set, and we 
can, thus, use E[m(A r ')] as a lower bound on E[m(OPT lu (iV))]J^] Hence, to prove the lemma it is enough to 
showEHAF)] > ^-K[w{M)\. 

The set H is maintained by the following rules: 

• Originally H is empty. 

• Whenever an element u is added to N', it is also added to H i f H + u G T. 

• Whenever an element u is added to M, it is also added to H. If that addition makes H non- 
independent, then an arbitrary element 0 ( 11 ) G II Gl A r ' such that H — 0{u) G X (such an element 
exists since H \ N' = M is independent) is removed from H. 

Consider now an arbitrary element u G E processed by Algorithm [2J and let us fix all history up to the 
point before u is processed. Notice that at this point w(u) is no longer a random variable. We are interested 
in the expected increase of w(M) and w(N') when u is processed. If M + u X or f(u \ M) < 0, then 
Algorithm [2] does not add u to either M or N, and thus, there is a zero increase in both w(M) and w(N'). 
Otherwise, if N' + u G X, then the expected increase in w(M) is w(u)/ 2, while the expected increase in 
w(N') is (p/2) • w(u). Finally, we need to consider the case that M + u G X and f(u \ M) > 0 but 
N' + u X. In this case the expected increase in w(M) is still w(u)/ 2, but the expected increase in w(N') 
is 0. To fix that, we charge (p/2)-w{u) to the element of H that becomes 0{u) if u is added to M (regardless 
of whether u is really added to M or not). 

Let c(u) be the amount charged to an element a. By the above discussion we clearly have: 

EHM)] = 1/2 1 m 

E[m(iV')]+E ueE E[c(rx)] p/2 

To complete the proof we upper bound the charge to every element u G E. Let us fix all history up to 
the point after u is processed. I f u f f{ fl N' at this point, then, by definition, c(u) = 0. Otherwise, from 
this point on, till u leaves H fl N’, every arriving element v! can have one of two behaviors: 

• v! does not cause a charge to be added to u. 

• u' causes u to be charged by (p/2) • w(u') < (p/2) • w(u) (the inequality holds because Algorithmic] 
considers elements in a non-increasing weights order). In this case, with probability 1/2, u' is added 
to M and u is removed from II 0 N' (and therefore, will not be charged again in the future). 

From the above two options we learn that when u gets into HC\N', c{u) is upper bounded by (p/2)-w(u)-X, 
where X is distributed according to the geometric distribution G(l/2). Thus, in this case: 

E[c(n)] < (p/2) • w(u) ■ EpT] = p • w(u) . 


Combining both cases, we get that the following inequality always holds: 

E[c(u)] < p • E[m(A^ / n {u})] . 
Plugging the last inequality into © gives: 


E [w(M)] 


< - => 


E[m(W)] + p ■ E[m(W)] p 1+p 


E [w(M)] < E[w(JV')] 


□ 


s In fact, N' is a maximum weight independent set in N since the process creating it is equivalent to running greedy on N with 
the objective function w. 
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Corollary 4.9. E[w(Q n N)] > • E [w(M)}. 

Proof. Follows immediately from Observation 14.71 and Lemma l4~8l □ 


The bound on E [f w (Q D N)] is obtained by showing that E [w{Q n N) — f w (Q 0 N)] < q ■ E[u>(iV) — 
f w (N )] for some value q > -. We later show that this inequality always holds for q = 1, which already 
allows us to prove Theorem 11.11 However, it turns out that by exploiting some basic properties of many 
algorithms for (linear) MSP, the same inequality can be shown for smaller values of q, which leads to 
stronger competitive ratios. 

Proposition 4.10. If~E[w(Q (IN) — f w (Q FI N)} < q • E[m(iV) — f w (N)] for some value q > then 
E[/(Q n N)} > E \f w (Q n N)} > • /(OPT). 

Proof Observe that: 


E [fw(Q n N)} > E[m(Q n N))-q- E[w(N) - f w (N )] (3) 

> a{1 P +p) ■ E[m(M)] - q ■ E[m(iV)] + q ■ E [f w (N)j 

- ' EWM)1 " qp ' E| ”’ (M)1 + + qJ TTT ■ E[/ ” (M)I ' 

where the first inequality holds by the assumption of the proposition, the second by Corollary 14.91 and the 
last by Lemmata 14.3 1 and [4~6l 

Lemma |4~T1 implies the following inequalities: 


P 


E[w(M)\ = 


P 


(E[/(M)] - /(0)) > 


P 


a(l+p) a(l+p) 

-EMM)] = —(E[/(M)] - /(0)) > —E[/(M)] 


E[/(M)] - 


q- /( 0 ) 


a{ 1 +p) /J 1 +p 

and E [f w (M)\ = E[/(M)] 


Plugging these inequalities into Inequality © yields: 


E[f w (QnN)} > 


V 


E [f{M)\ - qp ■ E[/ (M)\ + qP( ;\ P) ■ E[/(M)] 


a( 1 +p) 
p{ 1 - aq( 1 + p) + aq{ 1 - p)) 


l+p 

E[/(M)] = P(1 ~ 2 ^ a) • E[/(M)] . 


a{l+p) a(l+p) 

The proposition now follows by combining the last inequality with Corollary 14.41 


□ 


The following observation proves that one can use q = 1 for every algorithm Linear. 

Observation 4.11. w(S) — f w (S ) is a monotone function of S, hence, w(Q D N) — f w (Q ft N) < w(N) — 
UN). 

Proof. Let S be an arbitrary subset of E and let u G E "\ S. We need to show that w(S + u) — f w (S + u) > 
w(S) - f w ( S ), or equivalently, w(u) > f w (S + u) — f w (S). By definition, there exists a set B C S such 
that: 

f w (S) = f(B) + w(S\B) . 

Hence, 


fw(S + u) 


min {f(A) + w((S + u)\ A)} < f(B) + w((S + u)\B) 
f{B) + w(S \B) + w(u) = f w (S) + w(u) . 


The observation now follows by rearranging the above inequality. 


□ 
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We can now prove Theorem ll.il 


Proof of Theorem 17771 By Observation 14. 111 we can plug q = 1 into Proposition 14.101 Choosing p = 
(3a) -1 < 1, the proposition implies: 

E\f(Q n AO] > ^ 3a ] 1( !~ 2/ ; 3) • /(OPT) =-—^-- • /(OPT) . 

7J - 8a(l + (3a) -1 ) M 7 24a(3a + 1) M 7 

Hence, Algorithm [2] with p = (3a) -1 is 24a(3a + 1)-competitive for the matroid Ai. □ 

Note that p = (3a) -1 used in the above proof is not the minimizer of the expression p(l — 2pa)/(8a(l + 
p)) obtained from Proposition 14. 101 by setting q = 1. We use p = (3a) -1 for simplicity since it leads to a 
clean expression that is very close to the one obtained by the minimizing p. 

In some cases it is possible to use a smaller value of q in Proposition 14.101 The following two claims 
prove one such case. A set function g: 2 E —> R is called normalized if g{0 ) = 0 and supermodular if 
g{A) + g(B) < g(A U B) + g(A n B). 

Lemma 4.12. Let g: 2 E —> R + be a normalized, monotone and supermodular function. Denote by A(q) a 
random subset of A where each element appears with probability at most q (not necessarily independently). 
Then, E[g(A(q))] < q ■ g(A). 

Proof. Let A = {ii \,..., u\a } be an arbitrary numbering of the elements in A, and for i € {0, ...,|A|}we 
define Aj = {ui,..., //.,}, where A$ = 0. Denote by X, an indicator for the event that m € A(q), and let 
qi = Pr[rq € A(q)} = E[X ?; ] < q. Then: 



\A\ 


\A\ 

E[g(A(q))} = E 

g( 0) + ^2 x i- g( u i 1 A -1 n A (q )) 

i= 1 

< E 

g ( 0 ) + ^2 x i ■ g(ui | Ai-i) 

i=\ 


\A\ 

\A\ 



= ff( 0 ) + qi ' g ( Ui I Ai ~ i) - + Q ' g ( Ui I Ai ^> 

i =1 i =1 

= (!-?)■ 9(0) + q ■ g(A ) = q ■ g(A) , 

where there first inequality follows from the supermodularity of g, the second one from monotonicity and 
the fact that q* < q for 1 < i < |A|, and the last equality follows by the fact that g is normalized. □ 

Corollary 4.13. If Linear is guaranteed to pick every element of its input with probability at most q, then 
EMQ n N) - f w (Q Cl N)] < q ■ E[w(N) - f w (N)}. 

Proof. Let us define the function g(S) = w(S) + f w (0) — f w (S). Since w(S) is linear, f w (0) is a constant 
and f w (S) is submodular, g(S) is a supermodular function. Additionally, g(S) is normalized by construction 
and monotone by Observation 14.111 Hence, for every fixed N, by Lcmnia [4. 1 21 and the assumption that no 
element of N belongs to Q with probability larger than q: 

E (g(Q 0 N)\<q- g(N ) . 

Since the last inequality holds for every fixed N, it holds also, in expectation, without fixing N. Thus: 

E(w(Q n N) - f w (Q n N)] = E[g(Q n N)] - f w (0) 

< q ■ Eb(iV)] - f w (0) < q ■ E[w(N) - f w (N )] , 

where the last inequality holds since f w is non-negative by Property 12.21 □ 
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We can now prove Theorem ll.4l 

Proof of Theorem \L4\ Observe that q must be at least 1/a since any a-compctitive algorithm for MSP must 
be able to select an element with probability at least 1 /a when this element is the only element having a 
non-zero weight. Hence, by Corollary 14. 131 we can plug q into Proposition 14.101 Letting (3 = aq > 1 and 
choosing p = (3/3)” 1 < 1, the proposition implies: 


4(1-2 qa±) 4( 1-2/3) q 

w(Q fl N) > ^^ -— • /(OPT) = —— -f -—— • /(OPT) . 


8 a (! + ^) 8§(1 + ^) 


24/5(3/5 + 1) 


Hence, Algorithm [2] with p = (3/3 1 = (3 aq) 1 has a competitive ratio of at most: 

24/3(3/3 + 1) = 24aq(3ag + 1) = _ ^ 


for the matroid A4. 


□ 


4.4 Proof of Theorem 11.21 

In this section we assume A4 is a matroid for which Theorem 11.21 is meaniningful. More specifically, the 
behavior of Linear on A4 can be characterized as follows. 

• For every MSP instance over the matroid A4, there exists a set of k (correlated) random sets {Pi}/ =1 
such that each set Pi is always independent in A4 and E[m(lJ- =1 P,)] > u;(()PT,„)/q, where w is 
the weight function of the MSP instance and OPT,,, is the maximum weight independent set given 
w. 

• Linear outputs a uniformly random set from { P ,}( = ( . 

For every 1 < i < k, let Q, be the set P, corresponding to the execution of Linear within Algorithm [2] 
withp = (3 q) _1 . 

Lemma 4.14. E[/(UiU Qi O N)] > ' /(OPT). 

Proof One can verify that the fact that Q is produced by Linear was used in the above analysis of the 
competitive ratio of Algorithm [2] only to justify the inequality: E['«;((/)] > E • E['U)(OFT, /; (A r U A/))]. 
Thus, the proof of Theorem l 1.1 l ean be viewed as showing that after executing Algorithm [2] with p = (3o:p 1 
every random subset S of N U Nq obeying this inequality must obey also: 

E[/<SnN)la 2M3bT)' /(OPT) ' 

The lemma now follows since (jf=i Qi is a random subset of N IJ A/ obeying the required inequality. □ 

The next lemma is necessary to transform the bound given by the last lemma into a bound on the values 
of the separate sets Qi. 

Lemma 4.15. Let g : 2 E —> R + be a non-negative submodular function and let S\, S). C E. Then, 


'Ef(s i )>fl\Js i 


i= 1 


\i=l 
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Proof. We prove the result by induction on k. It clearly holds for k = 1. For k > 1: 


k 


k 


k 


£ /(#) = /(5r) + f(S 2 ) + £ /($) > /(S'! U S 2 ) + /(Si n S 2 ) + /($) 


2 — 1 



where the first inequality holds by the submodularity of /, the second by the non-negativity of / and the last 


by the induction hypothesis. 


□ 


We are now ready to prove Theorem 1 1.21 


Proof of Theorem [7721 Combining Lemmata 14. 14l and l4.151 we get: 


k 


k 


l 


^E[/(Qi n iV)] > E / I (J Qi n AT 


> 


24a(3a + 1) 


•/(OPT) . 


2—1 


, 2=1 


Since the output set Q n N of Algorithm |2] is a uniformly random subset from {Q, n AT}* =1 , we get: 



Hence, Algorithm [2] with p = (3a) 1 is 24 A; a (3a + 1)-competitive for the matroid M.. 


□ 
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A Monotone Functions 

In this section we prove the results for MSMSP that apply to general matroids, /.<?., Theorem 11.51 and 
Theorem 11.61 The online algorithm we use to prove these results is given as Algorithm [3j which is a close 
variant of Algorithm Q] Just like Algorithm [H this algorithm has a probability parameter p, but the role of 
this parameter in the algorithm is somewhat different. 

Like in Algorithm [Q it is easy to show that L contains every element of E with probability p, indepen¬ 
dently. For the analysis of Algorithm [3j we again need an equivalent offline algorithm given as Algorithm @J 
which is a close variant of Algorithm |2] 
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Algorithm 3: Monotone-Online(p) 

// Learning Phase 

1 Choose X from the binomial distribution B(n,p). 

2 Observe (and reject) the first X elements of the input. Let L be the set of these elements. 
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// Selection Phase 

Let M be the output of Greedy on the set L. 

Let IV 0. 

for each arriving element u € E \ L do 
Let w(u) -t— 0. 

if u is accepted by Greedy when applied to M + u then 

Add u to N. 

Let M u C M be the solution of Greedy immediately before it adds u to it. 
Update w(u) 4— f(u\ M u ). 


it 


Pass u to Linear with weight w(u). 


12 return Q n N, where Q is the output of Linear. 


A similar proof to the one given in Appendix Icl can be used to show that Algorithms [3] and [4] share the 
same joint distribution of the set M and the output set. 

A. 1 Analysis of Algorithms 0] and 0 

Throughout this section we fix an arbitrary matroid M = (E. 1) for which Linear is a-competitive and 
analyze the approximation ratio of Algorithm 0 Since Algorithms [3] and 0] share their outputs distribution, 
the approximation ratio we prove for Algorithm 0] implies an identical competitive ratio for Algorithm 0] 
The analysis closely follows the proof given in Section 0] and we mostly explain in this section how to 
modify the proof of Section0]to fit Algorithm 0] 

Let us start with the first stage in which we bound E [w(M)\ and E\f u ,(M)\. Lemma l4~Tl which ties both 
values to E [f(M)\, still holds, and thus, we only need to bound E[/(M)]. 

Lemma A.l. E[/(M)] > (p/2) ■ /(OPT). 

Proof We prove the corollary for the set M of Algorithm 01 which is fine since it has the same distribution 
as the set M of Algorithm0] Algorithm0]calculates M by applying Greedy to L. Greedy is known to have 
an approximation ratio of 1/2 for the problem of maximizing a non-negative monotone submodular function 
subject to a matroid constraint lfl8l . Hence, f(M ) > /(OPT(L))/2, where OPT(L) is the independent 
subset of L maximizing /. 

On the other hand, L contains every element of E with probability p, independently. Hence, 

trn > mxnm > £ (p/2) . /(0PT) _ 

where the second inequality holds since OPT 0 L is an independent set of A4 and the last inequality holds 
by Lemma 1231 □ 

This completes the first stage of the proof. In the second stage we bound E[m(A r )] and E[/„ ; (A r )]. In 
Section 0] the bound on E[m(A r )] is given by Lemma 031 The proof of this lemma uses the probability 
that an element u processed by Algorithm 0] is added to M (respectively, N) given that M + u € I and 
f(u | M) > 0. Let us denote this probability by pM (respectively, py). Using this terminology the 
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Algorithm 4: Monotone-Simulated(p) 

// Initialization 

1 Let M, N, N 0 <- 0. 

2 Let E' <r- E. 

// Main Loop 

while E' / 0 do 

Let u be the element of E' maximizing f(u \ M), and remove u from E'. 

Let w(u) <— f(u | M). 
if M + u is independent in M then 

with probability p do Add u to M. otherwise Add u to N. 

else 

Let w(u) <— 0. 

Add u to No with probability 1 — p. 

it Run Linear with N U A r 0 as the input (in a uniformly random order) and the weights defined by w. 
12 return Q fi N, where Q is the output of Linear. 


3 

4 

5 

6 

7 

8 
9 

10 


proof actually shows E[in(iV)] = ( Pn/pm) ■ E [w(M)\. It can be verified that the proof holds also for 
Algorithm [4[ as long as one uses the values pm and pjy corresponding to this algorithm (which are p and 
1 — p, respectively). Hence, we get the following lemma. 

Lemma A.2. E[m(iV)] = ■ E[io(M)]. 

Similarly, using the potential function &(M,N) = f w (N) — - • f w (M U N), the proof of 
Lemma gS] can be used to show that E[/^(iV)] > P ^+ PN ' f( 0 ) + P m+pn ' u N )] in both 

Algorithms [2] and 0] This gives the following lemma. 

Lemma A.3. E [f w {N)} > p ■ f{0) + (1 - p) • E [f w (M)\. 

Proof By the above discussion: 

nf w m > -- • f(0) + 1 ~ P ■ E [f w (M U N)} 

p+(l-p) p+{l-p) 

= P ■ f(0) + (!-*>)■ E[/ w (M U N)} . 


The lemma now follows since f w is monotone by Property 12.21 □ 

The last lemma bounds E[/m(iV)], and thus, completes the second stage of the proof. In the third 
stage we bound E[u>(Q D N)] and E [f w (Q (~l N)]. Observation 14.71 bounds E [w(Q n N)] in terms of 
E w.TOPT^.fiV))], and this observation holds also for Algorithm!!] Hence, to get a bound on E \w(Q n N)] 
we first need a bound on E;u;(()PT„ ; (A r ))]. The following lemma corresponds to Lemma l4~8l 

Lemma A.4. E[w(OPT w (N))] > (1 - p) ■ E[w(M)]. 

Proof The proof of Lemma 14.81 shows that there exist a random independent set N' C N and random 
variables {c(u)} u€ e such that: 


_ E[w(M)] __ pm_ 

E[m(A^')] + Y,u&e E [ c («)] PN 


and E[c(it)l < — • E[w(N' n {«})] . 
Pm 
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It can be verified that this proof holds also for Algorithm [4j and thus, plugging in the corresponding values 
of pm and p^, we get: 


E [w(M)] 


V 


EKW0] + £„ 6E E[c(u)] 1 -p 


and 


E[c(it)] < 


1-p 


R[w(N' n {«})] 


Combining the two equations gives: 


E [w(M)] p 

“ 1-P 


=> (1-p)- E \w(M)] < E[w(N’)] • 


The lemma now follows by observing that N' is a possible candidate to be OPT w (N), and therefore, 

w(OPT w (N)) > w(N'). □ 

Corollary A.5. E [w(Q n N)} > ^ • E [w(M)]. 

Proof. Follows immediately from Observation 14.71 and Lcninia [A~4l □ 

The bound on E [f w (Q fl N )] is obtained from the bound on R[w(Q Cl N )] by showing that R[w(Q n 
N) — f w (Q fl N )] < q • E[m(A’) — f w (N)] for some value q > 1/a. However, like in Section|4j we first 
prove the approximation ratio induced by every value of q and only then discuss the values that q can take. 

Proposition A.6. If~E[w(Q fl N) — f w (Q 0 N )] < q ■ E[w(iV) — f w (N)] for some value q > 1/a, then 
E [f(Q n N)] > E [f w (Q n N)] > 9(1 - p)(p /^ g )- 1+p ) • /(OPT). 

Proof. Observe that: 


E [f w (Q n N)] > E [w(Q 0 N)} — q ■ E[m(A r ) - f w (N)] (4) 

> — • E [w(M)] - q ■ R[w(N)] + q • E[f w (N)] 

a 

> • EKM)] - ■ EKM)] + q( 1 - p) ■ E [f w (M)\ , 

where the first inequality holds by the assumption of the proposition, the second by Corollary IA.5I and the 
last by Lemmata lA.2l and lA.3l and the non-negativity of /. 

Lemma |4~TI implies the following inequalities: 

— • nw(M)] = ^ ■ (E[/(M)] - /(0)) > ■ E[/(M)] - g (l - p) ■ /(0) , 

a a a 

_9(1_P) . EKM)] = . (E[/(M)] - /(0)) > ■ E[/(M)] + g(l - p ) • /(0) 

and 

q( 1 ~ P ) • HUM)} = 9(1 - p) • E [f(M)\ . 


Plugging these inequalities into Inequality (0]) yields: 


nUQ n N)} > ■ E[/(M)] - g(1 p) • E [f(M)\ + 9(1 - p ) • E [f(M)\ 


a p 

(1 ~p)(p/a - q + qp ) 


mm = ^-pXp/^-i+p) . mm 


p p 

The proposition now follows by combining the last inequality with Lemma lATTI 


□ 
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Recall that Observation 14.111 showed in Section [4] that one can always choose q = 1. This proof of this 
observation holds also for Algorithm [4] and thus, we get the following corollary. 

Corollary A.7. For p = -P-]-pL, the approximation ratio of Algorithm's at most 8 a (cc + 1). 

Proof. By Observation ^. Ill we can plug q = 1 into Proposition IA. 61 Hence, for p = the proposition 

implies: 


w(Q n N) > 


2ot+l \ ( 2ot+l _ 1 , 2a+l \ 

2(a+l)y y2a(q+l) ^ 2(a+l )) ^ QpT) 


1 / 2a+l 

2(a+l) ' y2a(a+l) 

— 


2(a+l) 


• /(OPT) 


1 

8 a(a + 1) 


•/(OPT) . 


□ 


Corollary IA.7I proves the first part of Theorem 11.51 corresponding to Theorem 11.11 The proof can be 
extended to prove the two other parts of the theorem in the same way this is done in Section |4] 

From this point on we assume Linear has, with respect to the matroid Ad, the properties guaranteed by 
Theorem 11.61 In other words, for every MSP instance over the matroid Ad there exists a random set S C E 
obeying: 

• S is always an optimal solution of the MSP instance. 

• For every element u £ E, Pr[u is selected by Linear] > - • Pr [u £ 5]. 

By removing appropriately chosen elements from the output of Linear one can get a new algorithm 
Linear ' which still has the above properties, but selects no element with probability larger than 1 /a. Clearly 
such an algorithm always exists although it might be non-efficient and offline, which is fine since we use 
Linear' only for analysis purposes. The following observation shows that Linear' must be a-competitive, 
and thus, all the results proved above hold for it. 


Observation A.8. An algorithm having the properties guaranteed by Theorem 17. 61 with respect to a matroid 
Ad is a-competitive for MSP over this matroid. 


Proof. Let T be the random output set of the algorithm given an MSP instance over M with a weight 
function w'. Then: 

E[u/(T)1 = V w'(u) ■ Pr [u £ T] > - ■ V w\u) ■ Pr[n € S] = - • E [w\S)} . 

u&E uClE 


The observation now follows since S is always an optimal solution for the MSP instance, and thus, w'(S) 
is always equal to the value of such an optimal solution. □ 

We are now ready to prove Theorem [L6] 

Proof of Theorem \L6\ Let Q and Q' be the sets produced by Linear and Linear' when these algorithms are 
placed in Algorithm[4l By construction, Q D Q', and thus, by the monotonicity of /: 


E [/ (Q n N)} > E[/ (Q' n N)} . (5) 

Consider now Corollary 14. 131 It can be verified that this corollary holds also for Algorithm 01 and thus, 
E[ru(Q' n N ) — f w (Q' fl N )] < P • E[m(A' r ) — f w (N)\. Plugging this inequality into Proposition |AA] gives: 

E[/(Q'n«)] > (1-P)(«PA»-1+P), /(0PX) = V-P)(2p-1 ). /(0PT) 

2a 2a 
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Choosing p = 3/4 now gives: 


E [f(Q' n N)] > (1 3/4 ] (3/2 • /(OPT) = -i- • /(OPT) . 

2a loa 

Combing the last inequality with Inequality © proves that Algorithm |4] with Linear and p = 3/4 is 16a- 
competitive for MSMSP over M.. □ 

B Results for Specific Matroids 

In this appendix we explain how to get the improved competitive ratios stated in Section 11.11 for specific 
classes of matroids. For transversal matroids and ^-sparse linear matroids the stated competitive ratios 
were obtained through an application of Theorem 11.41 More precisely, for transversal matroids we use 
Theorem 1 1.41 with the MSP algorithm of Korula and Pal lf27l . which is 8-competitive and picks no element 
with probability larger than 1/2. 

For /r-sparsc linear matroids we use Soto’s /re-competitive procedure li36)l . To get a stronger result than 
what we would get through an application of Theorem 11.11 we observe that we can assume that Soto’s 
algorithm selects no element with probability larger than q = 1/e. Soto’s algorithm reduces the problem 
to the classical matroid secretary problem, by losing a factor of k. The classical e-competitive algorithm of 
Dynkin can easily be adapted such that no element is chosen with probability more than 1/e, maintaining 
e-competitiveness of the algorithm. More precisely, Dynkin’s procedure selects the heaviest element with 
some probability p(n) > 1/e that only depends on the size n = \E\ of the ground set and can be calculated 
upfront. Moreover, no element is selected with a probability exceeding p(n). We can now modify Dynkin’s 
algorithm as follows, to obtain an algorithm that is still e-competitive and selects no element with probability 
larger than 1/e: whenever Dynkin’s algorithm would select an element u, we will toss a coin and only select 
it with probability l/(e • p(n )). Hence, using Soto’s algorithm with this modification of Dynkin’s procedure, 
we have a fce-competitive algorithm for MSP over fc-sparse linear matroids that selects no element with 
probability larger than 1/e. The competitive ratios claimed in Section 11.11 now follow from Theorem 11.41 
(for SMSP) and Theorem 1 1.51 (for MSMSP). 

Unitary partition matroids have a trivial e-competitive algorithm applying the classical secretary algo¬ 
rithm to every elements class separately. However, this algorithm has two weaknesses. First, it requires 
prior knowledge about the number of elements in each class, which we sometimes want to avoid. Second, it 
might select an element with a probability significantly larger than 1/e, which prevents an effective use of 
Theorem 1 1.4 1 In Section IbTTI we present a simple algorithm avoiding these issues. Plugging this algorithm 
into Theorem 1 1.4 1 yields the guaranteed competitive ratio for SMSP over unitary partition matroids. 

Ma et al. ll32l give an algorithm for MSP over laminar matroids selecting every element of OPT with 
probability at least 1 / 9.0 and no element with probability larger than 0.158. Plugging this algorithm as Linear 
into Theorems 11.41 and 11.61 results in competitive ratios of 1279 and 154 for SMSP and MSMSP over 
laminar matroids, respectively]^] This already improves over the result of ll32l for MSMSP and provides the 
first 0(l)-competitive algorithm for SMSP. However, by a careful combination of our ideas and a lemma 
from lf32l one can prove the stronger ratios stated in Section ITTT1 The details can be found in Section [B~2l 

B.l Algorithm for MSP over Unitary Partition Matroids 

In a unitary partition matroid the elements are partitioned into k disjoint non-empty classes { G, }/ = ,. A set S 
is independent in the matroid if and only if it contains at most one element of every class G t . Algorithm[5]is 
an algorithm for MSP over unitary partition matroids. Observe that this algorithm can be implemented with 

9 The result for SMSP can be improved to 1196 by optimizing the parameter of the algorithm of 03- 
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no access to the matroid besides of the independence oracle. As usual, the algorithm assumes all weights 
are non-negative and disjoint. This assumption is without loss of generality since negative weights can be 
replaced with zero weights (as long as elements of negative weights are rejected even if the algorithm tries 
to accept them) and ties between weights can be broken arbitrarily. 


Algorithm 5: Partition-MSP(ro) 

// Learning Phase 
t Let t <r- \n/e\. 

2 with probability t — n/e do Let X 4— t — 1. otherwise Let X <— t. Observe (and reject) the first X 
elements of the input. Let L be the set of these elements. 
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// Selection Phase 
for each arriving element u € E \ L do 
Let G u be the class of u. 
if G u has not been marked previously then 
if G u 0 L / 0 then 

if w(u) > max u / g Gg n L w{v!) then 
Mark G u . 

Accept u. 


to 

n 

12 


Mark G„ . 

Accept u with probability X/\L\. 


13 


Add u to L. 


Informally, Algorithm [5] observes (and discards) the X ps n/e first elements. Then, for every arriving 
element u the algorithms makes the following decisions. 

• If it’s class has been marked by a previous element, then u is simply discarded. A marked class is a 
class from which we should accept no new elements. A class is usually marked because some element 
of it has already been accepted, but not always. 

• Otherwise, if some elements of u’ s class have already arrived and u is better than all of them, then u 
is accepted and its class is marked. 

• Finally, if no element of u's class has previously arrived, then its class is marked and with some 
probability u is accepted. This is the only case in which a class can get marked despite the fact that 
no element of it has been accepted to the solution. 

Observation B.l. Algorithm\5\always produces an independent set. 

Proof. Once an element of a class G is accepted, the class G becomes marked and the algorithm dismisses 
immediately any future elements of this class. □ 

For every element u E E, let L u be the set L before u is processed, /.<?., it is the set of elements that 
arrived before u. Additionally, for every class Gi, let it* * be the element with the largest weight in G, (which 
is also the element of Gi in the optimal solution). 

Lemma B.2. For every 1 < i < k, u* is accepted with probability ^ n — \ + y"_rt,/»n 77 ■ 

Proof. Fix the variable X and the set L u * (and thus, also the location of u* in the input), and let us calculate 
the probability u* is accepted. If \L U *\ < X, then u* arrives during the learning phase (/. e ., it is one of the 
first X elements), and thus, is it not accepted. If | L u * \ > X , then there are three cases. 
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• The first case is when Gi D L u * = 0. In this case u* is the first element of G t encountered by 
Algorithm^ Since it is encountered after the learning phase (i.e., it is not one of the first X elements), 
it is accepted with probability X/\L U *\. 

• The second case is when Gi D L u * f 0 and the element u, with the maximum weight in G-, D L u * 
arrives during the learning phase. In this case every element of Gi n L u * arriving after the learning 
phase fails the condition on Line [7] of Algorithm [5] Thus, G t is still unmarked when u* anives, and 
thus, it is accepted. Since the order of the elements in L u * is uniformly random, this case happens 
with probability X/\L U * | conditioned on X and L u *. 

• The final case is when Gi n L u * f 0 and the element Ui with the maximum weight in 6/ n L u * arrives 
after the learning phase. In this case, either Gi is already marked when u t anives or Gi is marked 
when Ui is processed. Either way, when u* anives Gi is already marked, and thus, u* is dismissed. 

The above cases imply: 

f i i if I L u * A A 

Pr[n* is accepted | X, L u *\ = < “P 

0 otherwise . 


Regardless of X, the arrival time of u* is distributed uniformly between 1 and n. Hence, 


1 \ 

Pr[n* is accepted | X\ = — ■ - 

The law of total probability now gives: 


X_ = y^X_ 

n x j- 1 ~h J x nj 


n —1 1 n —1 

Pr[< is accepted] = (* - f] • E ~ + (' “ * + f) ' E ^ 

j=t~ 1 J j=t J 


= 0-0 + 

n e 




i _ I+v - - ^ n/e ^ _ I n_1 

n~e + ^ t ej~ n e .^ej 


n— 1 n 

+ E n 


n— 1 ^ 

E- 

ni 
j=t J 


□ 


For ease of notation, let us denote a(n) = (jn/e]/n — e 1 + (ej) 1 j . Lemma lB.2l shows 

that every element of OPT is accepted by Algorithm [5] with probability l/a(n), and thus, Algorithm [5] is 
a(n)-conipctitivc for MSP over unitary partition matroids. 

Observation B.3. For every n > 1, a(ri) < e. 


Proof. 


71-1 

E 

i-W.i e] e 

r dx i 

f n dx 

r\n/ e 1 

dx 




/ - — — ’ 

/ — 

- / 

— 




J |~n/e"| ^ ^ 

Jn/e ^ 

J n/e 

X 




1 

> - 

e 

n in [ [n/el dx 

[111 x\ |- n/e] / — = 

Jn e 11 

1 n/e 

- + 

e 

— |~n/e] 2 

n e 

\n/e] 

n 

□ 


The next lemma completes the analysis of Algorithm [5J 
Lemma B.4. No element is accepted with probability larger than l/a(n ) by Algorithm^ 
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Proof. Fix an arbitrary element a of a class G t . For every arrival order tv of the elements, let us define a (tv) 
as the order obtained from tv by replacing u and u*. Observe that cr is a bijection. 

Consider now an arbitrary order tv given which Algorithm [5] accepts u with probability p > 0. Clearly 
u proceeds u* in tv because otherwise w(u) could not be larger than the weights or all elements appearing 
earlier in tv, which is a necessary condition for u to be accepted. Thus, Algorithm [5] is in the same state 
when processing u given tv and u* given a (tv). Since w(u*) > w(u), this implies that Algorithm [5] accepts 
u* with probability of at least p given the order a (tv). 

Recalling that cr is a bijection, we now get: 


p ^ ^ ^ Pr[tt i s accepted given tv] > ^ Pr[u* is accepted given a( tv)] 


n\ 


E Prkr* is accepted given tv] „ , * . 1 

—--y---- = Pr[it* is accepted] = 


n! 


a(n) 


□ 


Remark: The fact that Algorithm [5] is a(n)-competitive while accepting no element with a probability 
larger than l/a(n) is sufficient to extract all the power of Theorem 1 1 .41 and get the result stated in the table 
in Section 11.11 Flowever, sometimes it is useful to have an algorithm that is e-competitive and accepts no 
element with a probability larger than 1/e. This can be achieved by executing Algorithm [5] and accepting 
with probability a(n)/e every element accepted by Algorithm!!] 


B.2 Results for Laminar Matroids 

In this section we prove the competitive ratios stated in Section [ITT] for laminar matroids. Throughout this 
section we assume Ad is a laminar matroid. Ma et al. lf32ll consider an algorithm which constructs the sets 
M and N in the same way this is done by Algorithm 0] For this algorithm, and thus, also for Algorithm!!] 
they proved the following lemma (assuming Ad is laminar). 

Lemma B.5 (Rephrased version of Lemma 10 of |f32l ). Consider Algorithm @ with Linear being the pro¬ 
cedure that accepts every element of N whose acceptance does not violate the independence of the current 
solution. Then 

EKQ)] > (i - jfrpf) ■ E K")] . 

where /3 = 2e(l — p). 

The last lemma can be used instead of Corollary IA.5l in the analysis of Algorithm!!] This allows us to 
prove the result for MS MSP. 

Corollary B.6. There exists a lAA-competitive algorithm for MSMSP over laminar matroids. 

Proof. Consider Algorithm fusing the algorithm Linear defined by Lemma HOI By Lemma I b 31 we get 

E[w(Q n N)] = EMQ)] > (l - ^^ 3 ) • E[w(N)], 

where [3 = 2e(l — p). Thus, 

E [.UQ n AT)] > E[w(Q n N)] - E[w(N)] + E [f w (N)] (6) 

> (i - (1^)3) • - E[m(iV)] + (1 - p) . E[f w (M)] 

= (1 - p) ■ HUM)] - -^^3 • E [w(N)] , 
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where the first inequality holds by Observation 14. 11 l and the second by Lemmata IA. 3 1 and the non-negativity 
of /. 

Recall that, by Lemmata 14. ll and fA.21 f w (M ) = f(M) and 

EKiV)] = ^ • E [w(M)\ = ^ • E[/(M) - /(0)] < l -^- ■ E[/(M)] . 

Plugging these inequalities into Inequality © yields: 


ELMQniV)] > 
> 

> 


(l-p)-E[/(M)] 

p / _ 2/3 l-p \ 

2 \ P (1-/3) 3 ' P ) 
P (, 4e(l — p) 

2 V P (1 — 2e(l — p)) 3 


• E[m(IV)] 

—-E[/(M)] 

P 

• E[/(OPT)] 

• E[/(OPT)], 


where the last inequality follows from E[/(M)] > §/(OPT), as stated by Lemma I aTTI The above expres¬ 
sion is maximized for p ~ 0.976299 leading to the claimed competitive ratio. □ 


To prove the result for SMSP, we first need an observation about Algorithm |2] Consider Algorithm |6] 


Algorithm 6: Middle-Algorithm(p) 

// Initialization 
t Let M,N <- 0. 

2 Let E' <— E. 


3 

4 

5 

6 

7 

8 


// Main Loop 

while E' / 0 do 

Let u be the element of E' maximizing f(u \ M), and remove u from E'. 

Let w(u) 4— /(tt | M ). 
with probability (1 + p)/2 do 

if M + u is independent in A4 and w(u) > 0 then 

with probability 1/(1 + p) do Add u to M. otherwise with probability p/(l + p) do 
_ Add it to IV. 


Observe that every processed element obeying the conditions on Line [7] of the Algorithm [6] is added by 
both Algorithms [2] and Algorithm [6] with probability 1/2 to M and with probability p/2 to N. Hence, both 
algorithms produce the same joint distribution of M and N. On the other hand, in terms of the sets M and 
N Algorithm |6] is identical to Algorithm |4] up to two modifications: 

• The value p in Algorithm |4]is replaced with 1/(1 + p). 

• Algorithm [6] dismisses some elements without processing them. These elements include all the ele¬ 
ments of negative weights and every other element with probability 1 — (1 + p)/2. 

The second modification does not affect the proof of f32il for Lemma lBAl tin fact, the proof requires elements 
of negative weight to be dismissed). Hence, we get the following lemma which is obtained from Lemma lBAl 
by replacing p with 1/(1 + p). 


27 






























Lemma B.7. Consider Algorithm\2\with Linear being the procedure that accepts every element ofN whose 
acceptance does not violate the independence of the current solution. Then 


EMQ)] > (i - -^ 3 ) 


EHiV)] , 


where [3 = 2e(l — 1/(1 + p)). 

The last lemma can be used instead of Corollary 14.91 in the analysis of Algorithm |2j This allows us to 
prove the result for SMSP. 

Corollary B.8. There exists a 585- competitive algorithm for SMSP over laminar matroids. 

Proof Consider Algorith mousing the algorithm Linear defined by Lemma IfTTl By Lemma IET71 we obtain 

E[w(Q n N)} = EHQ)] > (1 - 

where (3 = 2e(l — 1/(1 + p))- Thus, 

E [f w (Q n N)} > E[w(Q n N)] - E[w(N)] + E[f w (N)] (7) 

> (1 - (EEja) • E[«j(iV)] - E[«.(JV)1 + V f-A . E[/„(Af)] 

= ■ E I/«(")] - (T^s ■ E I»( N )1 . 

where the first inequality holds by Observation 14. 11 1 and the second by Lemmata 14. 6 1 and the non-negativity 
of/. 

Recall that by Lemmata 14.11 and 1431 f w (M ) = f(M) and 


E[u;(JV)] = p ■ E[w(M)] = p ■ E {f(M) - /( 0 )] < p ■ E[/(Af)] . 


Plugging these inequalities into Inequality © yields: 


E [UQ 0 IV)] > . E[f w (M)} - J-VL- ■ E[w(N)] 

1 +p (1 — p) J 


>p( 

1 — p 

J+p 

>?l 

(1 -v 

- 8 1 

U +p 

~ P \ 

(1 -v 

8 1 

+p 


irw) ■ E [/(° PT )] 

4e(l — 1/(1 + p)) \ 

(1 — 2 e(l — 1/(1 + p))) 3 ) 


• E[/(OPT)], 


where the last inequality follows by E[f(M)] > /(OPT )/8 which holds by Corollary 14.41 The above 
expression is maximized for p ~ 0.023769, leading to the claimed competitive ratio. □ 


C Equivalence of Algorithms U and |2] 

In this section we prove that Algorithms Q] and [2] share the joint distribution of the set M and the output set. 
First, for completeness, let us state the standard algorithm Greedy as Algorithm [7] Recall that Greedy is 


28 






















Algorithm 7: Greedy 

// Initialization 

1 Let M <- 0. 

2 Let E' t— E. 


3 

4 

5 

6 


// Main Loop 

while E' / 0 do 

Let u be the element of E' maximizing f(u \ M), and remove u from E'. 
if A/ + u is independent in A4 and f(u\M)>0 then 
Add u to M. 


7 return M. 


used by Algorithm [TJ and observe that, as mentioned in Section [3j the solution M of Greedy starts as the 
empty set and elements are added to it one by one. 

Let L and F be two independent random subsets of E, where the set L (respectively, F) contains 
every element u € E with probability 1/2 (respectively, p ), independently. By Observation 13.31 the set L 
constructed by Algorithm |T] has exactly the same distribution as the above set L. Hence, given access to the 
above sets L and F, Algorithm [Dean be rewritten as Algorithm [s] Notice that the only changes made during 
the rewrite are omission of the calculation of L and replacement of the random coin toss on Line [ 6 ] with a 
reference to F. 


Algorithm 8: Coupled Online(L, F) 

t Let M be the output of Greedy on the set L. 

2 Let N <r- 0 . 

3 for each arriving element a £ E \ L do 

4 Let w{u) <— 0. 

5 if u is accepted by Greedy when applied to M + u then 

6 if u € F then 

7 Add u to N. 

8 Let M u C M be the solution of Greedy immediately before it adds u to it. 

9 Update w(u) E- f(u \ M u ). 

to Pass u to Linear with weight w(u). 
it return Q n N, where Q is the output of Linear. 


Similarly, Algorithm [2] can be rewritten using L and F as Algorithm [9] In this case the coin tosses on 
Lines 171 and fl2lhave been replaced with references to L, and the coin toss on Line|7]has been replaced with 
a reference to F. It is important to note that an element is never checked at both Lines [7] and [[2l and thus, 
the independence of the randomness used by these lines is preserved. 

In the rest of this section we show that conditioned on L and F, Algorithms [ 8 ] and [9] produce the same 
set M and have the same output distribution. Hence, the original Algorithm [2]has the same joint distribution 
of the set M and the output set as Algorithm Q] Let us start the proof with the following simple observation. 

Observation C.l. The elements passed to Linear by both Algorithms\8\and\9\are exactly the elements of 
E\L. Moreover, the elements of E\ (L U N) are all assigned 0 weights. 

Next, observe that Algorithm [9] obtains M by running a modified version of Greedy on the elements of 


29 












Algorithm 9: Coupled Simulated(L, F) 

// Initialization 

1 Let M, N, N 0 <- 0 . 

2 Let E' <— E. 


3 

4 

5 

6 

7 

8 
9 


// Main Loop 

while E' / 0 do 

Let u be the element of E' maximizing f(u 
Let w(u) ■<— f{u | M). 
if M + u is independent in Ad and w(u ) > 
if u € L then Add u to M. else if a € 
Update w{u) <— 0. 

Add u to iY(j. 


| M), and remove u from E'. 

0 then 

F then Add u to N. else 


to 

it 

12 


else 


Let w(u) <— 0. 

if u 0 L then Add u to Nq. 


13 Run Linear with A r U A r 0 as the input (in a uniformly random order) and the weights defined by w. 

14 return Q d N, where Q is the output of Linear. 


E. The modified version is identical to the original one, except that every time immediately before adding 
an element u € E \ L to the result, the element is “stolen” (and gets to N or JVq instead). We denote such a 
modified run of Greedy by GreedyfA, E \ L ). More generally, we use GreedyM, B) to denote a run of 
greedy on the set A, where the elements of B C A are stolen if Greedy tries to add them to its solution. 

Observation C.2. For four sets A D B and C F I) obeying GreedyiA. B) C C \ D C A \ B, the runs 
Greedy(A. B) and Greedy (G. D) behave the same in the following sense: 

• The outputs of the two runs are identical, i. e., GreedyiA. B) = GreedyiC, D). 

• The run GreedyiA. IF) attempts to add an element of Ad C to the solution if and only if the run 
Greedy(C , D) attempts to add this element, moreover, both runs attempt to add the element to the 
same partial solution. 

Proof The behavior of Greedy depends only on elements that were added to the result set. Hence, elements 
that appear in the input but are stolen before being added to the output does not affect the output of Greedy, 
and the same holds for elements that greedy does not attempt to add to the output. □ 

Corollary C.3. Conditioned on a set L, Algorithms\S\and\9\produce the same set M. 

Proof. Algorithm Q] selects M as GreedyfL, 0 ), while Algorithm [9] select M as GreedyiA. E \ L ). The 
equality between these sets follows immediately from Observation |C2] since: 

Greedy(L, 0 ) c L = E\(E\L) . □ 

Corollary C.4. Conditioned on sets L and F, Algorithms \8\and\9\produce the same set N and assign the 
same weights to the elements of N. 

Proof. Consider an element u € E\L. Algorithm[ 8 ]adds u to A^ if and only if u £ F and Greedy (M +u, 0 ) 
tries to add u to its solution. When this happens, u is assigned the weight f(u \ M u ), where M u is the 


30 











solution Greedy has immediately before it process u. Clearly both the weight and the membership of u in 
N does not change if Greedy(M + u, 0) is replaced with Greedy(M + u, u) in the above description. 

On the other hand. Algorithm [9] adds u to A r if and only if u £ F and G reedy (N, E\L ) tries to add u 
to its solution. When this happens, u is assigned the weight f(u \ M u ), where M u is the solution Greedy 
has immediately before it process u. By Observation IC.21 both Greedy(Af + u, u) and Greedy(S, E\L ) 
behave the same because: 

Greedy(£, E\L) = Greedy (L, 0 ) = M = (M + u) - u c L = E\(E\L) . 

Hence, Algorithms [ 8 ] and [9] make the same decisions regarding the membership and weight of u. □ 

Observation 1C. 1 I together with the last corollary shows that, conditioned on L and F, Linear is fed by the 
same set E\L of elements and weights by Algorithms [ 8 ] and [9] To complete the proof that both algorithms 
produce the same output distribution, conditioned on L and F, we just need to show that the elements of 
E\L are passed to Linear in the same distribution of orders under both algorithms. This is true by the 
following observations. 

• Algorithm[ 8 ]passes the elements of E\L in the order in which they arrive, which is uniformly random. 

• Algorithm [9]passes the elements of N U ./Vo = E \ L in a uniformly random order. 


D Missing Proofs 

In this appendix we provide the missing proofs for Proposition 13.21 and Lemma l4~2l 

D. 1 Proof of Proposition 13.21 

Proposition 13.21 Given an algorithm Li near for MSP, there exists an algorithm for Partial- MSP whose 
competitive ratio for every matroid A4 is as good as the competitive ratio of Linear for this matroid. 

The algorithm we use to prove Proposition 13.21 is Algorithm ITOl 


Algorithm 10: Algorithm for Partial-MSP 

// Initialization 

t Let L! a — L. 

2 Let r <r- n — \L\. 


3 

4 

5 

6 

7 

8 
9 

10 


// Main Loop 

while \L'\ + r > 0 do 

with probability r /(\l.'\+r) do 
Update r <— r — 1. 

Pass the next arriving element to Linear with its original weight w(u). 

otherwise 

Let u be a uniformly random element of IJ. 

Update L' 4— L' — u. 

Pass u to Linear with a weight of 0. 


it Return the output of Linear without the elements of L. 
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Observation D.l. The input passed to Linear by Algorithm \T0\ is a uniformly random permutation of E, 
where the weigh t of an element u € E is defined by: 


w'{u) 


0 ifu E L , 
w(u) otherwise . 


Proof The only part of the observation that requires a proof is the claim that the elements of E are passed 
to Linear in a uniformly random order. To see why this is true, notice that r is maintained as the number of 
input elements that have not been passed yet to Linear and L' is the set of elements of L that have not been 
passed yet to Linear. Hence, at each time point the set of elements that have not been passed yet to Linear 
is of size r + \L'\, and the algorithm passes a uniformly random element from this set. □ 

The following lemma implies Proposition 13.21 

Lemma D.2. Al go rithin 170l is an algorithm for Partial-MSP whose competitive ratio for every matroid A i 
is as good as the competitive ratio of Linear for this matroid. 

Proof Assume Linear is o-compctitive for an arbitrary matroid A 4 = (E. I). By Observation lD.il when¬ 
ever Algorithm [lO]faces an instances of Partial-MSP over the matroid Ad with a weight function w, it makes 
Linear face an instance of MSP over the same matroid with weights given by w'. Hence, if we denote by S 
the output of Linear, then, by the guarantee of Linear: 


E[u/(5)] > a ■ max w' (T) 


Recall that w' is equal to w for elements of E\L and is equal to 0 for all other elements. Hence, 

EfiofiS \ L)] = E[«/(5)l > a ■ maxw'tT) = a ■ max w'(T) = a ■ max w(T) . 

T&x Tein2E\L Texn2E\L 

This completes the proof of the lemma since S \ L is the output of Algorithm ITOl and max TgIn 2 B\i w{T) 
is the value of the optimal solution for the instance of Partial-MSP faced by Algorithm[lO] □ 


D.2 Proof of Lemma 14.21 

Lemma [4.21 For a matroid and a non-negative submodular function f, if S is the independent set returned 
by Greedy, then for any independent set C, f{S) > f(C U S)/2. 

First, we need some definitions. Let k = 151, and for every 0 < % < k let S, he the solution of Greedy 
after i elements are added to it. Additionally, for every 1 < i < k, let u t be the single element in S h \ S,-\. 
We also need to define the sets 0 recursively as follows: 

• The set Cfi is simply C. Observe that Co U ,S'o = C is independent. 

• For every 1 < % < k, the set G t is a maximal independent subset of C',_ i + Ui that contains ,5',. It is 
also useful to define C[ = C',_i \ C ,, the set of elements that appear in C',_ i but not in C,. By matroid 
properties, since Si — rq = Si -1 C Q_i and Q_i is independent, the size of C[ is at most 1. 

Let us now make a few observations regarding the above definitions. 

Observation D.3. f(Sk) > f{Ck). 

Proof Assume towards a contradiction that f(Sk ) < fiCf)- Then, by submodularity since Sk L= Ck, there 
must exist an element a G CV \ Sj. obeying f(u \ Sk) > 0- On the other hand, since Ck is independent 
we must also have that Sk + u is independent. However, the existence of an element with these properties 
contradicts the fact that Sk is the output of Greedy. □ 
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Lemma D.4. For every 1 < i < k, f(ui \ Si- 1 ) > f(C■ U Ck) — f(Ck). 

Proof If C[ = 0, then the right hand side of the inequality we want to prove is 0 while its left hand side 
is non-negative (since Greedy chose to add rq to Si- 1 ). Thus, we concentrate from this point on the case 

l^| = L 

Let u[ be the single element of C[. Since C t contains the set Si-\ and is independent, we know that u! i 
is not in Si-\ and could be added to Si-\ by Greedy without violating independence. On the other hand, 
since Greedy chose to add Ui to Si- 1 , we must have: 

f{ui | Si-!) > f(u' I Si-!) > f(u'i I C k ) = f{C[ U Ck) - f(C k ) , 

where the second inequality follows by submodularity since Si -1 C Sr C C'/,.. □ 

We arc now ready to prove Lemma l4~2l 

f(s) = f(s k ) + /(0) + J 2 j =1 f(uj | Sj-!) > f(Ck) + /(0) + Y)i=ilf( c i u Ck) - f(Ck )] 

> f(Ck) + /(0) + [/(Uti Cj u C k ) - f(Ck)] _ /(0) + f(s uc) x /(5 u C) 

- 2 2 — 2 ’ 

where the first inequality holds by Observation ID.3I and Lemma ID. 41 the second inequality holds by sub¬ 
modularity of / and the fact that the C[ are disjoint, and the last by the non-negativity of /. 
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